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Abstract 

We investigate whether kernelization results can be obtained if we restrict kernelization 
algorithms to run in logarithmic space. This restriction for kernelization is motivated by the 
question of what results are attainable for preprocessing via simple and/or local reduction 
rules. We find kernelizations for c?-hitting SEt(A:), d-SET PACKiNG(fc), edge dominating 
SET (A:) and a number of hitting and packing problems in graphs, each running in logspace. 
Additionally, we return to the question of linear-time kernelization. For d-HiTTiNG SEt(A:) a 
linear-time kernelization was given by van Bevern [Algorithmica (2014)]. We give a simpler 
procedure and save a large constant factor in the size bound. Furthermore, we show that 
we can obtain a linear-time kernel for d-SET PACKiNG(fc) as well. 


1 Introduction 

The notion of kernelization from parameterized complexity offers a framework in which it is 
possible to establish rigorous upper and lower bounds on the performance of polynomial-time 
preprocessing for NP-hard problems. Efficient preprocessing is appealing because one hopes 
to simplify and shrink input instances before running an exact exponential-time algorithm, 
approximation algorithm, or heuristic. A well-known example is that given an instance {G,k), 
asking whether graph G has a vertex cover of size at most k, we can efficiently compute an 
equivalent instance {G', k') where k' <k and G' has at most 2k vertices [2]. On the other hand, 
the output instance could still have edges and a result of Dell and van Melkebeek [4] 

indicates that this cannot be avoided unless NP C coNP/poly (and the polynomial hierarchy 
collapses). Many intricate techniques have been developed for the field of kernelization and 
some other variants have been considered. For example, the more relaxed notion of Turing 
kernelization asks whether a problem can be solved by a polynomial-time algorithm that is 
allowed to query an oracle for answers to instances of small size [13]. 

In this work we take a more restrictive view. When considering reduction rules for NP-hard 
problems that a human would come up with quickly, these would often be very simple and 
probably aimed at local structures in the input. Thus, the matching theoretical question would 
be whether we can also achieve nice kernelization results when restricted to “simple reduction 
rules.” This is of course a very vague statement and largely a matter of opinion. For local 
reduction rules this seems much easier: If we restrict a kernelization to running in logarithmic 
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space, then we can no longer perform “complicated” computations like, for example, running a 
linear program or even just finding a maximal matching in a graph. Indeed, for an instance x, 
the typical use of log |x| bits would rather be to store a counter with values up to or to 

remember a pointer to some position in x. 

The main focus of our work is to show that a bunch of classic kernelization results can also 
be made to work in logarithmic space. To the best of our knowledge such a kernelization was 
previously only known for VERTEX COVEr(/c) [1]. Concretely, we show that d-HiTTiNG SEt(/c), 
d-SET packing(A:), and EDGE DOMINATING SEt(A:) as well as a couple of implicit hitting set 
and set packing type problems on graphs admit polynomial kernels that can be computed in 
logarithmic space. The astute reader will instantly suspect that the well-known sunflower lemma 
will be behind this, but—being a bit fastidious—this is only partially true. 

It is well-known that so-called sunflowers are very useful for kernelization (they can be used 
to obtain polynomial kernels for, e.g., d-HiTTiNG SEt(/c) [7] and d-SET packing(A:) [3]). A 
A:-sunflower is a collection of k sets Ti,..., such that the pairwise intersection of any two sets 
is the same set C; called the core. The sets Fi\C,... ,Fk\C are therefore pairwise disjoint. 
When seeking a /c-hitting set S, the presence of a (/c-|-I)-sunflower implies that S must intersect 
the core, or else fail to hit at least one set Tj. The Sunflower Lemma of Erdos and Rado implies 
that any family with more than dlk'^ sets, each of size d, must contain a {k + l)-sunflower which 
can be efficiently found. Thus, so long as the instance is large enough, we will find a core C 
that can be safely added as a new constraint, and the sets Fi containing C may be discarded. 

Crucially, the only point of the disjoint sets Fi \ C,..., Ffc \ C is to certify that we need at 
least k elements to hit all sets Fi,..., F^, assuming we refuse to pick an element of C. What if 
we forgo the disjointness requirement and only request that not picking an element of C incurs 
a hitting cost of at least k (or at least k + 1 for the above illustration)? It turns out that the 
corresponding structure is well-known under the name of a flower: A set T" is a /c-flower with 
core C if the collection {F \C: F F, F D C} has minimum hitting set size at least k. Despite 
the seemingly complicated requirement, Hastad et al. [9] showed that any family with more than 
k'^ sets must contain a {k + l)-flower. Thus, by replacing sunflowers with flowers, we can save 
the extra d! factor in quite a few kernelizations with likely no increase to the running time. In 
order to meet the space requirements for our logspace kernelizations, we avoid explicitly finding 
flowers and instead use careful counting arguments to ensure that a (A; -|- l)-flower with core 
C C F exists when we discard a set F. 

Finally, we also return to the question of linear-time kernelization that was previously studied 
in, e.g., [14, 15]. Using flowers instead of sunflowers we can improve a linear-time kernelization 
for d-HiTTiNG SEt(/c) by van Bevern [15] from d! • • {k + 1)^ to just {k -|- 1)'^ sets (we also 

save the d^"*"^ factor because of the indirect way in which we use flowers). Similarly, we have 
a linear-time kernelization for d-SET pagking(/c) to {d{k — 1) -|- 1)“^ sets. We note that for 
linear-time kernelization the extra applications for hitting set and set packing type problems 
do not necessarily follow: In logarithmic space we can, for example, find all triangles in a 
graph and thus kernelize triangle-free vertex deletion(A:) and triangle pagking(/c). 
In linear time we will typically have no algorithm available that can extract the constraints 
respectively the feasible sets for the packing that are needed to apply a d-HiTTiNG SEt(A:) or 
d-SET pagking(/c) kernelization. 

We remark that the kernelizations for d-HiTTiNG SEt(/c) and d-SET pagking(/c) via repre¬ 
sentative sets (cf. [12]) give more savings in the kernel size. For d-HiTTiNG SEt(/c) this approach 
yields a kernel with at most (^^^) = = ^(A:-|- 1) •... • (A:-|-d) > ^ sets, thus saving at most 

another d! factor. It is however unclear if this approach can be made to work in logarithmic 
space or linear time. Applying the current fastest algorithm for computing a representative set 
due to Fomin et al. [8] gives us a running time of \F\d^ + jT"] ) where u is the 
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matrix multiplication exponent. 


Organization We will start with preliminaries in Section 2 and give a formal introduction on 
(sun)flowers in Section 3. We present our logspace kernelization results for d-HiTTiNG SEt(/c), 
d-SET packing(/c), and edge dominating SEt(A:) in Sections 4, 5, and 6 respectively. In 
Section 7 we describe how our logspace kernels for packing and hitting sets can be used in order 
to obtain logspace kernelizations for implicit hitting and packing problems on graphs. We show 
how our techniques can be used in conjunction with a data-structure and subroutine by van 
Bevern [15] in order to obtain a smaller linear-time kernel for d-HlTTlNG SEt(A:) in Section 8. 
This also extends to a linear-time kernel for d-SET pagking(A:) which we give in Section 9. 
Concluding remarks are given in Section 10. 

2 Preliminaries 

Set families and graphs. We use standard notation from graph theory and set theory. Let 
[/ be a finite set, let be a family of subsets of U, and let S C U. We say that S hits a set 
F G T” if S' n F 7 ^ 0. In slight abuse of notation we also say that S hits T if for every F G F it 
holds that S hits F. More formally, S is a hitting set (or blocking set) for F if for every F G F 
it holds that S'nF7^0. If|S|<A:, then S' is a Fhitting set. A family F C F is a packing 
if the sets in V are pairwise disjoint; if |F| = k, then V is called a /c-packing. In the context 
of instances {U,iF,k) for d-HiTTiNG SEt(A:) or d-SET packing(/c) we let n = \U\ and m = |F|. 
Similarly, for problems on graphs G = (V,E) we let n = |I7| and m = |F|. A restriction Tc of 
a family F onto a set C is the family {F \C :FgF, FDC}, i.e., it is obtained by only taking 
sets in F that are a superset of C and removing C from these sets. 

Parameterized complexity. A parameterized problem is a language Q C S* x N; the second 
component of instances (x, k) G S* x N is called the parameter. A parameterized problem 
Q C S* X N is fixed-parameter tractable if there is an algorithm that, on input (x, /c) G S* x N, 
correctly decides if (x,/c) G Q and runs in time 0(/(A:)|x|‘^) for some constant c and any 
computable function /. A kernelization algorithm (or kernel) for a parameterized problem 
Q C S* X N is an algorithm that, on input {x,k) G S* x N, outputs in time (|x| -|- k)^^^'> an 
equivalent instance {x',k') with |x^| k' < g{k) for some computable function 5 ^: N —N such 
that (x, k) G Q (x', k') G Q. Here g is called the size of the kernel; a polynomial kernel is a 
kernel with polynomial size. 

3 Sunflowers and flowers 

The notion of a sunflower has played a significant role in obtaining polynomial kernels for the 
d-HiTTiNG SEt(/c) and d-SET packing(/c) problems. We start with a formal definition. 

Definition 1. A sunflower with I petals and core C is a family F = {Fi,...,F} such that 
each Fi\C is non-empty and Fi n Fj = C for all iff. 

The prominent sunflower lemma by Erdos and Rado states that we are guaranteed to find a 
sunflower with sufficiently many petals in a d-uniform set family if this family is large enough. 

Lemma 1 (Erdos and Rado [5]). Let F be a family of sets each of cardinality d. If |F| > 
dl{l — 1)*^, then F contains a sunflower with I petals. 
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This lemma can be made algorithmic such that we can find a sunflower with I petals in a 
set family T" in time 0{\T\) if IJ^I > d\{l — 1)'^. Flum and Grohe [7] apply this result to obtain a 
polynomial kernel for d-HiTTiNG SEt(A:) by repeatedly finding a sunflower with k + 1 petals and 
replacing it by its core C. This operation preserves the status of the d-HiTTiNG SEt(/c) problem 
since any /c-hitting set S for F must hit all sets in the sunflower. Because the sunflower has 
at least k + 1 petals, S must contain an element of C. Alternatively, as used for example by 
Kratsch [11], one can instead look for sunflowers with at least k + 2 petals and discard sets such 
that k + 1 petals are preserved. The presence of these k + 1 petals in the reduced instance still 
forces any A:-hitting set S to hit the core C; thus S must hit the discarded sets as well. This 
has the advantage that, besides preserving all minimal solutions, a subset of the family given 
in the input is returned in the reduced instance. Kernels adhering to these properties preserve 
a lot of structural information and are formalized as being expressive by van Bevern [15]. 

Fellows et al. [6] give a polynomial kernel for d-SET pagking(A:). Dell and Marx [3] provide 
a self-contained proof for this result that uses sunflowers. Here the crucial observation is that 
any /c-packing of sets of size d can intersect with at most dk petals of a sunflower if it avoids 
intersection with the core (the same argument is implicit in a kernelization for problems from 
MAX NP in [11]). Each of the described kernelization algorithms returns instances of size 0{k'^). 
However, as a consequence of using the sunflower lemma, there is a hidden dl multiplicative 
factor in these size bounds. We avoid this by considering a relaxed form of sunflower, known as 
flower (cf. Jukna [10]) instead. 

Definition 2. An /-flower with core C is a family T such that any blocking set for the restrietion 
Fc of sets in F onto C eontains at least I elements. 

Note that every sunflower with I petals is also an /-flower but not vice-versa. From Def¬ 
inition 2 it follows that the relaxed condition for set disjointness outside of C is still enough 
to force a /c-hitting set S to contain an element of C if there is a {k + l)-flower with core C. 
Similar to Lemma 1, Hastad et al. [9] give an upper bound on the size of a set family that must 
contain an /-flower. The next lemma is a restatement of this result. We give a self-contained 
proof following Jukna’s book. 

Lemma 2 (cf. Jukna [10, Lemma 7.3]). Let F be a family of sets eaeh of cardinality d. If 
jT”] > (/ — l)'^, then F eontains an l-flower. 

Proof. We prove the lemma for any / by induction over 1 < d' < d. If d' = 1, then the lemma 
obviously holds since any / sets in F are pairwise disjoint and even form a sunflower with core 
C = lb. Let us assume that it holds for sets of size d' — 1; we prove that it holds for sets of size 
d' by contradiction. Suppose that there is no /-flower in F while jJ-'j > (/ — l)'^ • Let X be a 
minimal blocking set for F. We have that jXj < / — 1, otherwise F itself is an /-flower with 
core C = 0. Since X intersects with every set in F and jJ-"] > (/ — 1)*^ , there must be some 

/7 _ 1 1 

element x G X that intersects more than ^ = (/ — 1)“^ sets. Therefore, the restriction Fx 

contains more than (/ — l)'^^”^ sets, each of size d' — 1. By the induction hypothesis we find that 
Fx contains an /-flower with core C and obtain an /-flower in F with core C = C II {x}. □ 

The proof for Lemma 2 implies that we can find a flower in 0{\F\) time if jX] > (/ — l)'^ 
by recursion; Let X be a family of sets of size d in which we currently want to find an /-flower. 
Pick an element x such that Xr has more than (/ — 1)*^“^ sets; then find a flower in Fx and add 
X to its core. If no such x exists, then return X instead since any set of size / — 1 intersects with 
at most (/ — 1) • (/ — l)'^”^ = (^ “ l)'^ < l-^l sets in X, i.e., a blocking set for X requires at least / 
elements. However, in order to obtain our logspace and linear-time kernels we avoid explicitly 
finding a flower. Instead, by careful counting, we guarantee that a flower must exist with some 
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fixed core C if two conditions are met. To this end we use Lemma 3. Note that we no longer 
assume J- to be d-uniform but instead only require that any set in T is of size at most d, similar 
to the families that we consider in instances of d-HiTTiNG SEt(/c) and d-SET packing(/c). If 
the required conditions hold, we find that a family F either contains an Lflower with core C or 
the set C itself. Thus, any hitting set of size at most I — 1 for T" must contain an element of C. 
For our d-HiTTiNG SEt(A;) kernels we use the lemma with I = k + 1. 

Lemma 3. For a finite set U, eonstant d, and a set C G ^ {<d) ® family sueh 

that 

(1) there are at least supersets F D C in F and 

(2) there are at most supersets F' 'DC' in T for any other C D C, C ^ . 

Then T contains an I-flower with core C or C £ F. 

Proof. Let us consider the restriction Fc = {S \ C : S £ F, S D C} of sets in F onto C. If 
C £ F, then we are done. In the other case, let X be a blocking set for Fc, i.e., X n F 7 ^ 0 for 
all F £ Fc (by assumption C ^ F, thus 0 ^ Fc and a blocking set exists). For every element 
X £ X consider the number of sets in Fc that contain x; these correspond to the supersets of 
C' = C U {x} in F. We obtain from property ( 2 ) that there are at most such 

sets. Thus, \Fc\ < |X| • l<i-\C\-i since every set in Fc has a non-empty intersection with X 
while we have previously bounded the number of sets in Fc that contain at least one element 
of X. By property (1) of F we have that \Fc\ > Therefore |X| > I and consequently 

F' = {S : S £ F,S D C} is the desired Lflower. □ 

The astute reader may find that Lemma 3 is in a sense not completely tight. Indeed, if we 
require that there are more than (I — supersets of C (instead of at least that many) and 

at most {I — I supersets for bigger cores C, then an /-flower with core C must exist (if 

X hits Fc, then (/ — < \Fc\ < |X| • (/ — therefore |X| > / — 1). For technical 

convenience, the present formulation is more suitable for our algorithms. 

4 Logspace kernel for Hitting Set 


d-HiTTiNG SEt(A:) Parameter; k. 

Input: A set U and a family F of subsets of U each of size at most d, i.e., F C (^), and 
A: G N. 

Question: Is there a fc-hitting set S for F? 

In the following we present a logspace kernelization algorithm for d-HiTTiNG SEt(A:). The 
space requirement prevents the normal approach of finding sunflowers and modifying the family 
F in memory (we are basically left with the ability to have a constant amount of pointers and 
counters in memory). We start with an intuitive attempt for getting around the space restriction 
and show how it would fail. 

The intuitive (but wrong) approach at a logspace kernelization works as follows. Process 
the sets F £ F one at a time and output F unless we find that the subfamily of sets that where 
processed before F contains & {k -\- l)-flower that enforces some core C C F to be hit. For a 
single step t, let Ft be the sets that we have processed so far and let F't F Ft he the family 
of sets in the output. We would like to maintain that a set S' is a /c-hitting set for Ft if and 
only if it a /c-hitting set for F'f Now suppose that this holds and we want to show that our 
procedure preserves this property in step t + 1 when some set F is processed. This can only 
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fail if we decide to discard F, and only in the sense that some 5 is a fc-hitting set for but 
not for Ft+i because -Ty+i C Ft+i- However, S is also a /c-hitting set for F[ C and, by 
assumption, also for Ft- Recall that we have discarded F because of a (/c + l)-flower in Ft with 
core C, so S must intersect C (or fail to be a /c-hitting set). Thus, S intersects also F D C, 
making it a /c-hitting set for Ft U {F} = Ft+i- Unfortunately, while the correctness proof would 
be this easy, such a straightforward approach fails as a consequence of the following lemma. ^ 

Lemma 4. Given a family of sets F of size d and F £ F. The problem of finding an I-flower 
in F\F with core C F F is coNP-hard. 

Proof. We give a reduction from the coNP-hard not (/-hitting set problem which answers yes 
for an instance if it does not have a hitting set of size at most k. Given an instance {U, F, k), we 
add a set F that is disjoint from any set in F and ask if F contains a (A: -|- l)-flower with core 
OFF. We show that this is the case if and only if F does not have a fc-hitting set. Suppose 
that there is a (/c -I- l)-flower with core C F F. By construction, C = tti since F is completely 
disjoint from F. Thus, F must contain a (/c-|- l)-flower with an empty core, i.e., the hitting set 
size for F = F^rs at least k + 1. For the converse direction suppose that there is no hitting set 
of size k for F, i.e., a hitting set for F requires at least k + 1 elements. Consequently, T” is a 
{k -\- l)-flower with core C = $ F F. □ 

Even if we know that the number of sets that where processed exceed the bound of 
Lemma 2, finding out whether there is a flower with core C F F is hard.^ Instead we use an 
application of Lemma 3 that only ensures that there is some flower with C F F if two stronger 
counting conditions are met. Whether condition (1) holds in Ft can be easily checked, but 
there is no guarantee that condition (2) holds if Ft exceeds a certain size bound, i.e., this does 
not give any guarantee on the size of the output if we process the input once. We fix this by 
taking a layered approach in which we filter out redundant sets for which there exist flowers 
that ensure that these sets must be hit, such that in each subsequent layer the size of the cores 
of these flowers decreases. 

We consider a collection of logspace algorithms Hq) • • • > and families of sets T'(O),... , T'((/) 
where F{1) is the output of algorithm Ai. Each of these algorithms simulates the next algorithm 
for decision-making, i.e., if we directly run Ai, then it simulates Hz+i which in turn simulates 
y 4 ;+ 2 , etc. If we run Ai however, then it is the only algorithm that outputs sets; each of the 
algorithms that are being simulated as a result of running Ai does not produce output. 

We maintain the invariant that for all C C /7 such that I < \C\ < d, the family F{1) contains 
at most {k -\- supersets of C. Each algorithm Ai processes sets in F one at a time. Eor a 

single step /, let Ft be the sets that have been processed so far and let Ft{l) denote the sets in 
the output. Note that Ai will not have access to its own output Ft{l) but we use it for analysis. 

Let us first describe how algorithm processes set F in step / -|- 1. If F ^ Ft, then 
Ad decides to output F; in the other case it proceeds with the next step. In other words 
Ad is a simple algorithm that outputs a single copy of every set in F. This ensures that the 
kernelization is robust for hitting set instances where multiple copies of a single set appear. If 
we are guaranteed that this is not the case, then simply outputting each set F suffices. Clearly 
the invariant holds for F{d) since any C F only has F = C as a superset and there is at 
most {k + 1)^ = 1 copy of each set in F{d). 

^It is well known that finding a fc-sunflower is NP-hard in general. Similarly, finding a fc-fiower is coNP-hard. 
For self-contained proofs see Appendix A. Both proofs do not apply when the size of the set family exceeds the 
bounds in the (sun)fiower lemma. 

^Note that we would run into the same obstacle if we use sunflowers instead of flowers; finding out whether 
there exists a (fc -P l)-sunfiower with core C C F for a specific set F is NP-hard as we show in Appendix A.3 
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Algorithm 1: Step t + 1 oi Ai, 0 < I < d. 


1 

2 

3 

4 

5 

6 

7 

8 
9 


simulate up to step t + 1; 
if A /+1 decides not to output F then 
I do not output F and end the computation for step t + 1; 
else 

for each C C F with \C\ = I do 
simulate A/+i up to step t; 

count the number of supersets of C that would output; 
if the result is at least {k + then 

1^ do not output F and end the computation for step t + 1; 
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output F; 


For 0 < I < d the procedure in Algorithm 1 describes how Ai processes F in step t + 1. First 
observe that lines 2 and 3 ensure that F{1) C F{1 + 1). Assuming that the invariant holds for 
F{1 + 1),... ,F{d), lines 8 and 9 ensure that the invariant is maintained for F{1). Crucially, we 
only need make sure that it additionally holds for C G since larger cores are covered by the 
invariant for F{1 + 1 ). 

Observation 1. Aq commits at most {k + 1)'^ sets to the output during the computation. This 
follows from the invariant for F{0) when considering (7 = 0 which is in (>;) for 1 = 0. 

Lemma 5. For 0 <l < d, Ai can he implemented such that it uses logarithmic space and runs 
in 0(1 time. 

Proof. Upon running Ai, at most d — I algorithms (one instance of each A;_|_i,..., Ad) are being 
simulated at any given time, i.e., when we run A; at most d — I + 1 algorithms actively require 
space for computation. In order to iterate over a family F, a counter can be used to track 
progress, using log | bits. Each algorithm can use such a counter to keep track of its current 
step. Let us assume that the elements in U are represented as integers {!,...,|C/|}. This enables 
us to iterate over sets C C F using a counter which takes dlog \ U\ bits of space. Set comparison 
and verifying containment of a set (7 in a set in F can be done using constant space since 
the sets to be considered have cardinality at most d. Finally, each algorithm requires at most 
log((A: + 1)'^) = dlog{k + 1) bits to count these sets, where k < \U\ in any non-trivial instance. 

Let us now analyze the running time. Clearly, Ad runs in time 0{\F'\^). There are at most 
2^ subsets (7 C F for any set F of size d. Thus, for / < i < d, each Ai consults Aj+i a total of 
0{\F\) times during its computation (at most 1 + 2^ times in each step). All other operations 
take constant time, thus Ai runs in time 0{\F\'^~^ ■ \F\^) = 0{\F\'^~^~^‘^). □ 

Let us remark that we could also store each (7 C F in line 5, allocate a counter for each of 
these sets, and simulate only once instead of starting a new simulation for each subset. 
This gives us a constant factor trade-off in running time versus space complexity. One might 
also consider a hybrid approach, e.g., by checking x subsets of F at a time. 

We will now proceed with a proof of correctness by showing that the answer to d-HiTTiNG 
SEt(/c) is preserved in each layer 7^(0) C F(l) C ... C F{d). 

Lemma 6 . Let 0 < I < d and let S be a set of size at most k. It holds that S is a hitting set 
for F{1) if and only if it is a hitting set for F{1 + 1). 
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Proof. For each 0 < I < d, we prove by induction over 0 < t < m that a set 5 is a A:-hitting 
set for if and only if it is a fc-hitting set for + !)• This proves the lemma since 

•Tm(0 = •T(/) and Fm{l + 1) = •F'(/ + 1). For t = 0 we have To(0 = •^o(^ + 1) = 0 and the 
statement obviously holds. Let us assume that it holds for steps t < i and prove that it also 
holds for step i + 1. One direction is trivial; If there is a /c-hitting set S for Fi+i{l + 1), then S 
is also a hitting set for Fi+i{l) C + 1). 

For the converse direction let us suppose that a set 5 is a fc-hitting set for We must 

show that S is also a hitting set for Fi+i{l +1). Let F be the set that is processed in step i + 1 
of Ai. If Ai decides to output F, then we have Fi+i{l) = Fi{l)U{F}, i.e., S must hit Fi{l)[J{F} 
and thus by the induction hypothesis it must also hit Fi{l + 1) U {F} = Fj+i(/ + 1). 

Now suppose that Ai does not output F, i.e., F ^ Fi+i{l). First let us consider the easy case 
where Ai decides not to output F because A+i decided not to output F. Thus, F ^ Fj+i(/ +1) 
and Fi+i{l + 1) = Fi{l + 1). A /c-hitting set S for Fj+i(/) hits Fi{l) and by the induction 
hypothesis it must hit Fi{l + 1); we have that S also hits Fi+i{l -|- 1) = Fi{l + 1). 

In the other case we know that Ai decides not to output F because it established that there 
are at least {k + supersets of some C F F with ICI = I in Fi{l + 1). Furthermore, by 

the invariant for F{1 + 1) we have that for all sets C that are larger than C there are at most 
{k + lY~\C\ supersets F' C C' in Fi{l -|- 1 ) F F{1 + 1). Consequently, by Lemma 3 we have 
that Fi{l + 1) contains C or a (/c -|- l)-flower with core C. Thus, in the first case any hitting set 
for Fi{l + 1) must hit C, and in the second case any hitting set for Fi{l -|- 1) requires at least 
k + 1 elements if it avoids hitting C; in other words, a hitting set of size at most k must hit 
C. Since S is a /c-hitting set for Fi+i{l) it must also hit Fi{l) C Fi+i{l) and by the induction 
hypothesis we have that S is also a /c-hitting set for Fi{l + 1). We have just established that S 
must hit C in order to hit Fi{l -|- 1) since it has cardinality at most k. Thus, S also hits FAC 
and therefore S' is a hitting set for Fi+i{l -|- 1) = Fj(/ -|- 1) U {F}. □ 

It is easy to see that a set 5 is a /c-hitting set for F{d) if and only if it is a hitting set for F, 
because A^ only discards duplicate sets. As a consequence of Lemma 6 , a set S is a /c-hitting 
set for F(0) if and only if it is a /c-hitting set for F{d). Therefore, it follows from Observation 1 
and Lemma 5 that Aq is a logspace kernelization algorithm for Fhitting SEt(/c). 

Theorem 1. cZ-hitting SEt(/c) admits a logspace kernelization that runs in time 0{\F\^~^‘^) 
and returns an equivalent instance with at most {k + 1)“^ sets. 

This kernelization is expressive; indeed, a subset of the input family is returned in the 
reduced instance and all minimal solutions up to size at most k are preserved (the latter is 
consequence of any set S of size k being a hitting set for F(0) if and only if 5 is a hitting set 
for F). Let us remark that technically we still have to reduce the ground set to size polynomial 
in k. We can reduce the ground set of the output instance to at most d{k + 1)'^ elements by 
including one more layer. Let A^+i be an algorithm that simulates A^. Each time that Ad 
decides to output a set F, algorithm Ad+i determines the new identifier of each element e in 
F by counting the number of distinct elements that have been output by Ad before the first 
occurrence of e. This can be done by simulating Ad up to first step in which Ad outputs e by 
incrementing a counter each time an element is output for the first time (whether an element 
occurs for the first time can again be verified via simulation of Ad). We can take the same 
approach for the other logspace kernelizations given in this paper (either for ground sets or 
vertices). 



5 Logspace kernel for Set Packing 


d-SET packing(A:) Parameter: k. 

Input: A set U and a family T of subsets of U each of size at most d, i.e., J- C (<^), and 

ken. 

Question: Is there a /c-packing V C 

In this section we present a logspace kernelization algorithm for d-SET packing(A:). The 
strategy for obtaining such a kernelization is similar to that in Section 4. However, the correct¬ 
ness proof gets more complicated. We point out the main differences. 

We consider a collection of logspace algorithms Bq, ... ,Bii that perform almost the same 
steps as the collection of algorithms described in the logspace kernelization for d-HiTTiNG SEt(/c) 
such that only the invariant differs. For each 0 < / < d we maintain that for all C C U such 
that / < ICI < d, the family B{1) that is produced by Bi contains at most {d{k — 1 ) -|- 
supersets of C. 

Observation 2. Bq commits at most {d{k — 1) +1)'^ sets to the output during the computation. 
This follows from the invariant for iF{0) when considering C* = 0 which is in [^i) for I = 0. 

Analogous to Lemma 5 we obtain the following. 

Lemma 7. For 0 < I < d, Bi can be implemented such that it uses logarithmic space and runs 
in 0(1 time. 

The strategy for the proof of correctness is similar to that of Lemma 6. However, we need 
a slightly stronger induction hypothesis to account for the behavior of a solution for d-SET 
pagking(A;) since it is a subset of the considered family. 

Lemma 8. For 0 < I < d, it holds that F{1) contains a packing V of size k if and only if 
F{1 + 1) contains a packing V' of size at most k. 

Proof. For each 0 < Z < d, we prove by induction over 0 < t < m that for any 0 < j < k and 
any set S C ^t{l) contains a packing V of size j such that S does not intersect with 

any set in V if and only if Ft{l + 1) contains a packing V' of size j such that S does not intersect 
with any set in V'. This proves the lemma since Fm{l) = d-'(Z), Fm{l + 1) = F{1 + 1), and for 
packings of size j = k the set S is empty. It trivially holds for any f if j = 0; hence we assume 
0 < j < k. For t = 0 we have ^^(0 = + 1) = 0 and the statement obviously holds. Let us 

assume that it holds for steps t <i and consider step i -|- 1 in which F is processed. If 
contains a packing V of size j, then Fj_|_i(Z -|- 1 ) also contains V since dT+i(Z + 1 ) 2 
Thus, the status for avoiding intersection with any set S remains the same. 

For the converse direction let us assume that Fi+i{l + 1) contains a packing V of size j 
that avoids intersection with a set S of size d{k — j). We must show that Fi+i{l) also contains 
a j-packing that avoids S. Let F be the set that is processed in step i -|- 1 of H;. Suppose 
that F ^V. Then Fi{l + 1) already contains V and by the induction hypothesis we have that 
Fi{l) F Fi+i{l) contains a j-packing V' that avoids S. 

In the other case F e V. Suppose that F G J'j+i(Z), i.e., Bi decided to output F. We 
know that Fi{l + I) contains the (j — l)-packing V \ {F} which avoids SU F. By the induction 
hypothesis we have that Fi{l) contains a packing V” of size j — 1 that avoids S F F. Thus, 
Fj+i(Z) contains the j-packing V' = V” U {F} which avoids S. 

Now suppose that F ^ Fi+i{l). By assumption, F e V F Fi+i{l + I). This implies that 
Bi decided not to output F because it has established that there are at least d{k — 1) + 
supersets of some C F F with \C\ = I va. Fi{l + 1). Furthermore, by the invariant for F{1 + 1) we 
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have that for all sets C that are larger than C there are at most {d{k — 1) + supersets 

F' D C' in Fiil + 1 ) C F(l + 1). Consequently, by Lemma 3 we have that Ti{l + 1) contains C 
or a {d{k — 1) + l)-flower with core C. In the first case, any hitting set for Fi{l + 1) must hit 
C, and in the second case any hitting set for Fi{l + 1) requires at least d{k — 1) + 1 elements if 
it avoids hitting C; thus any hitting set of size at most d{k — 1) must hit C. By assumption, 
V \ {F} and S both avoid C C F and therefore they both avoid at least one set F' in Fi{l + 1) 
since together they contain at most d{k — 1) elements. Thus we can obtain a j-packing V" in 
Fi+i{l + 1) that also avoids S by replacing F with F'. Since V" no longer contains F we find 
that Fi{l + 1) contains V" and by the induction hypothesis there is some packing V' of size j 
in Fi{l) C Fi+i{l) that avoids S. □ 

Since only discards duplicate sets it holds that F{d) has a fc-packing if and only if F 
has a /c-packing. As a consequence of Lemma 8 we have that F(0) has a /c-packing if and only 
if F{d) has a /c-packing. Therefore, it follows from Observation 2 and Lemma 7 that Bq is a 
logspace kernelization algorithm for d-SET packing(/c). 

Theorem 2. d-SET packing(/c) admits a logspace kernelization that runs in time 0{\F\^^'^) 
and returns an equivalent instance with at most {d{k — 1 ) -|- l)'^ sets. 

6 Logspace kernel for Edge Dominating Set 


EDGE DOMINATING SEt(A:) Parameter: k. 

Input: A graph G{V,E) and /c G N. 

Question: Is there a set S C F of at most k edges such that every edge in F \ S' is incident 
with an edge in SI 

Our strategy for obtaining a logspace kernelization algorithm for edge dominating SEt(/c) 
is as follows. First observe that the vertices of an edge dominating set of size at most k are 
a vertex cover of size at most 2k. This is frequently used in algorithms for this problem. 
Accordingly, we run our kernelization from Section 4 for the case of vertex cover (d = 2) with 
parameter 2k, obtaining an equivalent instance G' in which all minimal vertex covers of size at 
most 2k are preserved and proceed to add all edges between vertices of G'. 

We use some simulation to carry this out in logspace. Let R^c denote our logspace kerneliza¬ 
tion for d-HiTTiNG SEt(/c) with d = 2 and parameter 2k. The logspace kernelization algorithm 
Feds for EDGE DOMINATING SEt(/c) proceeds as follows (see Algorithm 2). Count the number 
of vertices with degree at least 2k+ \ that Fyc would output via simulation. If this is more than 
2k, then return a NO instance. Otherwise, output any edge between vertices that Fyc would 
output, again via simulation. Let us now give an upper bound on the number of edges that 
Feds will output. 

Lemma 9. Feds commits 0{k^) edges to the output during computation. 

Proof. First observe that Fyo would output a set of edges E" of size at most {2k + l)‘^ = 4A:^ -I-2A: 
(the number of edges that the logspace kernel for d-HiTTiNG SEt(/c) with d = 2 and parameter 
2k would output). Let H <ZV {E") denote vertices with degree at least 2A:-|-1, and let F C F(F") 
denote vertices with degree between 1 and 2k. Assume that Feds does not output a NO instance. 
We have that |F| < 2k and \L\ = 0{k‘^). Feds outputs all edges between vertices in H, all 
edges between vertices in L and all edges between H and L. Let us first bound the number 
of edges between vertices in L. If Fyc would not output an edge e, then this is because it has 
determined that there is a {k + l)-flower with an endpoint of e as its core (in other words, there 
are at least k + 1 other edges incident with the same vertex). By assumption Feds does not 
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output a NO instance, therefore the case where an edge is discarded because there is a flower 
with empty core does not apply. Thus, any edge that i?vc would not output has at least one 
endpoint with degree A: + 1. Therefore, i?vc would output all edges between vertices in L and 
we have that there are at most \E''\ < 4/c^ + 2k edges between vertices in L. There are 0{k‘^) 
edges between vertices in H and a further 0{k ■ k'^) = 0{k^) edges between H and L. Thus, 
i?EDs outputs 0{k^) edges, as claimed. □ 

Let us now show that i?EDs meets the time and space requirements for a logspace kernel. 

Lemma 10. i?EDs can he implemented such that it uses logarithmic space and runs in 0(|£^|^) 
time. 

Proof. Analogous to Lemma 5 we have that i?vc can be simulated in logspace and runs in time 
0{\E\^) (we assume that G is a simple graph; this saves a factor \E\ in the running time). It 
takes 0(|yI • |£^P) time and logarithmic space to execute Lines 1 to 5: Besides the space reserved 
for simulating i?vc) keep one counter to iterate over vertices in V and another for counting the 
number of high degree vertices; both of these counters require at most log \V\ bits. Executing 
Lines 6 to 7 clearly takes constant time and logspace. For Lines 8 to 12, reserve some memory 
to simulate i?vc) two more bits in order to track if i?vc outputs any edges incident to tt or u, 
and use log \E\ bits for a counter to iterate over edges in E. This takes OdEj • \E\^) = 0(|Ed) 
time, i.e., the total running time is 0{\V\ ■ \E\^ + \E\‘^) = 0{\E\‘^). □ 

We proceed with a proof of correctness. 

Lemma 11. Let G = {V,E) be the input graph for which Reds outputs a graph G' = {V',E'). 
R holds that G has an edge dominating set S of size at most k if and only if G' has an edge 
dominating set S' of size at most k. 

Proof. Suppose that S is an edge dominating set of size at most k for G'. Therefore, E(5) is a 
vertex cover of size at most 2k for G'. Let G" = iV', E") be the subgraph of G' = {V', E') that 
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Rvc would return. Now V{S) is a vertex cover of size at most 2k for G" and by Lemma 6 we 
have that V{S) is a 2/c-vertex cover for G. Therefore, S is an edge dominating set for G since 
the endpoints of edges in S cover all edges in E. 

For the converse, suppose that S is an edge dominating set of size at most k for G. We 
know that V{S) is a vertex cover of size at most 2k for G and therefore also for G' = {V',E') 
since E' C E. Now it could be the case that S has some edges that only have one endpoint in 
V. Thus we will obtain an edge dominating set S' of size at most k as follows, starting with 
S' = 0. We consider each edge e = {u, u} G S. li u,v G V', then e G E'] we add e to S' in order 
to cover edges that are incident with e. Otherwise, w.l.o.g., we have u G V',v ^ V', i.e., there 
are no edges in E' that are incident with v. Then e can be replaced by any other edge e' that 
is incident with u; we add e' to S'. □ 

The following theorem is a consequence of Lemmas 9 through 11. 

Theorem 3. edge dominating SEt(A:) admits a logspace kernelization that runs in time 
0{\E\‘^) and returns an equivalent instance with 0{k^) edges. 

7 Logspace kernelization for hitting and packing constant sized 
subgraphs 


?^-FREE VERTEX deletion(/c) Parameter: k. 

Input: A graph G{V,E), and /c G N. 

Question: Is there a set S of at most k vertices of G such that G\y \ 5] does not have any 

H gT-L as an induced subgraph? 

In this section we consider some hitting and packing problems on graphs. We will start with 
a logspace kernelization algorithm for ?^-free vertex deletion (A:) where we assume that for 
some constant d we have \V{H)\ < d for all H Gki. We first describe a ’sloppy’ version of this 
kernel for which we allow parallel edges to appear in the output graph. This algorithm, denoted 
as Rq, is an adaptation of the logspace kernelization for d-HiTTiNG SEt(/c) in which we let V 
play the role of the finite set U. The family E consists of those sets of vertices corresponding to 
the occurrences of graphs of R in G, i.e., a hitting set for is a solution for ?^-free vertex 
deletion(/c). 

Note that we must iterate over sets in E in order to run the d-HiTTiNG SEt(A:) logspace 
kernelization while E is not given explicitly in the input. In order to do this the algorithm 
reserves dlog n bits of space. This allows it to iterate over all subsets of vertices with cardinality 
at most d. It then identifies that such a set is in E if its induced graph coincides with a forbidden 
graph in R, which can be verified in constant space. 

Only two such iterators are required, one for step by step processing of sets in E, and 
another to obtain the count that is used to decide whether or not a set F G should appear 
in the output. When Rq does decide to output F, we simply output all edges in G[F]. This is 
where parallel edges can appear since the sets of edges that are committed to the output may 
partially overlap. Using Rq as a building block, we now present the proper logspace kernelization 
algorithm Ri for ^-FREE VERTEX deletion(/c) (Algorithm 3). 

Observation 3. Ri commits at most output during the computation. 

This follows from Observation 1, where each set corresponds to a graph with at most d vertices 
and edges. 

It is easy to see that Ri resolves the issue with parallel edges and is executable in logspace. 
For running time analysis, note that family E has size The running time for Rq 


12 






Algorithm 3: Ri: Logspace kernel for ?^-free vertex deletion (A:). 

1 . foreach e G -E do 

2 simulate Rq] 

3 if Rq decides to output e for the first time then 

4 output e; 

5 halt the simulation of Rq] 

is since we do not have multiple copies of sets in R. Therefore Ri runs in time 

0{\E\ ■ ). Consequently, we give the following theorem. 

Theorem 4. "H-free vertex deletion(/c) admits a logspace kernelization that runs in time 

0(\E\ ■ ) and outputs an equivalent instance with at most *^^^2 ‘ edges, where 

d is the maximum number of vertices of any H € R. 

A similar adaptation of the d-SET packing(A:) logspace kernelization algorithm yields a 
logspace kernel for the problem of finding a size k disjoint union of occurrences of a constant 
sized graph H in a host graph G, e.g., Triangle Packing(/c). 

Theorem 5. 4r-PAGKiNG(/c) admits a logspace kernelization that runs in time 0{\E\ ■ ) 

and outputs an equivalent instance with at most ■ (^d{k — 1) +1)*^ edges, where d = \V{E[)\. 

8 Linear-time kernel for Hitting Set 

We will now present a kernelization for d-HlTTlNG SEt(A:) that runs in linear time. The algo¬ 
rithm processes sets in E one by one and decides whether they should appear in the final output 
or not. For a single step t, let Et denote the sets that have been processed by the algorithm 
so far and let Jy C Et be the sets stored in memory for which it has decided positively. The 
algorithm also uses a data structure in which it can store the number of supersets of a set 
C G [Yd) stored in Efi Let supersets[C] denote the entry for set C in this data 

structure. We begin by initializing supersets[C'] ^ 0 for each CEE where E £ E. 

We maintain the invariant that the number of sets F £ E[ that contain C £ is at most 
{k + Now assume that the invariant holds after processing sets Et E F and let F be 

the next set. Algorithm 4 describes how the algorithm processes F in step t -|- 1. 

Algorithm 4; Step t -|- 1 of the linear-time kernel for d-HiTTiNG SEt(A:). 

1 foreach C* C F do 

2 query supersets[C] in order to determine the number of supersets of C in Efi 

3 if the result is at least {k + then 

4 1^ do not store F and end the computation for step t -|- 1; 

5 store F; 

6 foreach C C F do 

7 1^ supersets[C'] ^ supersets[C']-|-l; 

Note that Line 3 and 4 ensure that the invariant is maintained. After all sets in E have 
been processed, the algorithm returns the family E' E E oi sets that it has stored in memory. 


13 








Observation 4. After any step t, the algorithm has stored at most {k + 1)'^ sets in T[. This 
follows from the invariant when eonsidering (7 = 0. 

Lemma 12. The algorithm can he implemented such that it runs in linear time. 

Proof. Before we run the algorithm, we sort the sets in iF using the linear-time procedure 
described in [15]. Implementing the data structure as a trie [15] allows us to query and update 
supersets[C'] in time 0{\C\) for a set (7 where \C\ < d. We perform both of these operations 
at most 2'^ times in each step (this follows from the number of sets C C F that we consider). 
Therefore we spend only constant time for each set that is processed and hence the algorithm 
runs in time 0{\F\). □ 

We obtain the following lemma in a similar way to Lemma 6. The key difference is that we 
can now check if there are at least {k + supersets of a set C in Ft by using the data 

structure. Crucially, the algorithm has direct access to the previously processed sets that it 
plans to output; this simplifies the proof. 

Lemma 13. When a family F F F has been processed, the algorithm has a family Jy C Ft 
such that for any set S of size at most k, S is a hitting set for Ft if and only if S is a hitting 
set for F't. 

Proof. We prove the lemma by induction and show for each step t that a set S of size k is a 
hitting set for Jq if and only if S' is a hitting set for Ft. For t = 0 we have F^ = Fq = it) and 
the statement obviously holds. Now let us assume that it holds for steps t < i and prove that 
it also holds for step i + 1. One direction is trivial: If a set 5 is a A:-hitting set for then it 

is also a hitting set for T J7+i- 

For the converse direction let us suppose that a set 5 is a A:-hitting set for 77^^. We must 
show that S is also a hitting set for 77+i- Let F be the set that is processed in step i + 1. We 
must consider the case where the algorithm decides not to output F. Otherwise a hitting set for 
F'tj^i must hit F[ U {F} and thus by the induction hypothesis it must also hit Ft U {F} = J7+i- 
Now let us assume F ^ This implies that the algorithm decided not to output F 

because for some (7 C T it has determined that there are at least (k -|- supersets of F in 

F[. Furthermore, by the invariant we have that there are at most {k + l)‘^~^^ I supersets of C in 
F[ for any C' that is larger than (7. Consequently, by Lemma 3 we have that F[ contains (7 or a 
(/c-|-1)-flower with core (7, i.e., any hitting set of size k for 77 must hit (7 and therefore also hits 
F D C. Let 5 be a /c-hitting set for 77^^ and note that S is also a /c-hitting set for Tq C 77_|_j^. 
By the induction hypothesis, S is also a hitting set for 77. We have just established that S 
must hit (7 in order to hit Tq. Thus, S hits F and is a hitting set for 77+i = 77 U {F}. □ 

Reducing the size of the ground set is simple since we have the equivalent instance in memory. 
In one pass we compute a mapping of elements occurring in the equivalent instance to d{k + 1)*^ 
distinct identifiers. The following theorem is a consequence of Observation 4, Lemma 12 and 
Lemma 13. 

Theorem 6. d-HiTTiNG SEt(/c) admits a linear-time kernelization which returns an equivalent 
instance with at most {k + l)'^ sets. 

9 Linear-time kernel for Set Packing 

In the following we will present a linear-time kernelization for d-SET PACKING(A:). The algorithm 
performs almost the same steps as the linear-time kernelization for d-HiTTiNG SEt(A:) such that 
only the invariant differs. In this case we maintain that the number of sets F £ F^ that contain 
(7 G is at most {d{k — 1) + We point out the main differences. 
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Observation 5. After any step t, the algorithm has stored at most {d{k — 1) + sets in 
This follows from the invariant when considering C* = 0. 

Analogous to Lemma 12 we obtain the following. 

Lemma 14. The algorithm can be implemented such that it runs in linear time. 

Let us proceed with a proof of correctness. 

Lemma 15. When a family Tt TT has been processed, the algorithm has a set C Ft such 
that Ft contains a packing V of size k if and only if Ft contains a packing V' of size k. 

Proof. We prove the lemma by induction and show for each step t that for 0 < j < k and any 
set S C F't contains a packing V of size j such that S does not intersect with any 

set in V if and only if Ft contains a packing V' of size j such that S does not intersect with 
any set in V. This implies the lemma statement since for packings of size j = k the set S is 
empty. It trivially holds for any t if j = 0; hence we assume that 0 < j < k. For t = 0 we have 
T'q = To = 0 and the statement obviously holds. Now let us assume that it holds for steps t < i 
and prove that it also holds for step i + 1. If contains a packing V of size j, then 
also contains V since Ft+i D Thus, the status for avoiding intersection with any set S 

remains the same. 

For the converse direction let us assume that contains a packing V of size j that avoids 
intersection with a set S of size d{k — j). We must show that also contains a j-packing 
that avoids S. Let F be the set that is processed in step i + 1. Suppose that F fiV. Then Ft 
contains V and by the induction hypothesis we have that F[ C contains a packing V' of 
size f that avoids S. 

In the other case F £ V. Suppose that F £ We know that Ft contains the packing 

F \ {F} which avoids S U F. By the induction hypothesis we have that F' contains a packing 
V" of size j — 1 that avoids S U F. Hence, Tl+i contains the j-packing F' = F" U {F} which 
avoids S. 

Now suppose that F ^ This implies that the algorithm decided not to output F 

because it has determined that there are at least {d{k — 1) -|- l)'^“l^l supersets of some set 
C C F in F-. Furthermore, by the invariant we have that there are at most {d{k — 1) -|- I 

supersets of C in F[ for any C' that is larger than C. Consequently, by Lemma 3 we have 
that F[ contains C or a {d(fk — 1) -|- l)-flower with core C, i.e., any hitting set for F[ of size at 
most d{k — 1) for F' must hit C. We show that there is a j-packing in F■_^_l that avoids S: We 
know that F \ {F} is a (j — l)-packing in Ft that avoids 5 U F. By the induction hypothesis, 
there is a (j — l)-packing F" in that also avoids S U F. By assumption, F” and S both 
avoid C F F and therefore they also avoid at least one set F' in F^ since together they contain 
at most d{k — 1) elements. Thus, we can obtain the packing F' = F" U {F} of size j which 
avoids S. □ 

The following theorem is a consequence of Observation 5, Lemma 14 and Lemma 15. 

Theorem 7. d-SET packing(/c) admits a linear-time kernelization which returns an equivalent 
instance with at most {d{k — 1) -|- 1)^ sets. 

10 Concluding remarks 

In this paper we have presented logspace kernelization algorithms for d-HiTTiNG SEt(/c) and 
d-SET pagking(A:). We have shown how these can be used to obtain logspace kernels for hitting 
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and packing problems on graphs, and have given a logspace kernelization for edge dominating 
SEt(A:). By using flowers instead of sunflowers we save a large hidden constant in the size of 
these kernels. Furthermore, we have improved upon a linear-time kernel for d-HiTTiNG SEt(A:) 
and have given a linear-time kernel for d-SET packing(/c). One question to settle would be 
whether a vertex-linear kernel for vertex COVEr(A:) can be found in logspace. While known 
procedures for obtaining a vertex-linear kernel seem unsuitable for adaptation to logspace, we 
currently also do not have the tools for ruling out such a kernel. The problem of finding a 
vertex-linear kernel for vertex COVEr(/c) in linear time also remains open. 
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A Finding a maximum (sun)flowers 

A.l NP-hardness proof for finding a snnfiower of size 1. 

Proof. Reduction from d-SET packing, d > 3. Suppose {U, T, k) is an instance of d-SET pack¬ 
ing. We obtain an instance for sunflower as follows. Let m = |J^|. Make disjoint 

copies Ti,..., Tm+i of T such that Ti has subsets over finite set \Ji. Let C/' = C/i U ... U Um+i, 
let U ... U Tm+i, and let I = k{m + 1). We show that T has a set packing V of size 

k if and only if contains a sunflower of with I petals. First, suppose P" has a packing V of 
size k. Then we obtain a sunflower with k{m + 1) petals and an empty core by taking k sets 
corresponding to V in each Pi. For the converse direction, suppose that P' has a sunflower with 
k{m + 1) petals and core C. Since each family Pi contains at most m sets we know that there 
must be at least one petal in Pi and another in Pj for some i ^ j. Thus, (7 = 0. Furthermore, 
since there are 1) petals, there must be some i such that at least k petals are in Pi. These 

k petals form a packing since they must be completely disjoint. □ 

A.2 coNP-hardness proof for finding a flower of size 1. 

Proof. Reduction from not d-HiTTiNG set. Suppose {U, P, k) is an instance of not d-HiTTiNG 
SET. We obtain an instance {U',P',l) for /c-flower as follows. Let m = |J-'|. Make disjoint 
copies Pi, ..., Pm+i of P such that Pi has subsets over finite set Ui. Let C/' = (7i U ... U Um+i, 
let P' = PiU.. .UPm+ii and let I = {k + l){m + l). We show that P does not have a hitting set 
S of size k if and only if P’ contains a (-flower. First, suppose P does not have a /c-hitting set S. 
Therefore, there must be some collection of sets in P that requires at least k + 1 elements to hit. 
By taking copies of these sets from each Pi and choosing (7 = 0 we obtain a (/c-|-l)(m-|-l)-flower. 
For the converse direction, suppose that P' has a {k + l){m + l)-flower with core C. Since each 
family Pi contains at most m sets we know that this {k -|- l)(m -|- l)-flower must contain some 
sets from Pi and Pj such that i ^ j. Thus, (7 = 0. Furthermore, since by definition we have 
that {k + l)(m -|- 1) elements are required to hit P', we know that there must be some Pi where 
we need at least k + 1 elements for a hitting set. □ 

A.3 NP-hardness proof for finding a snnfiower with I petals and core CPF 
for a specific set F. 

Proof. Reduction from d-SET packing, d > 3. Given an instance {U,P, k), we add a set F that 
is disjoint from any set in P and ask if P contains a sunflower with k petals and core CPF. 
We show that this is the case if and only if P has a set packing of size k. Suppose that P has a 
sunflower with k petals and core CPF. By construction, (7 = 0 since F is completely disjoint 
from P, i.e., the sunflower is a set packing of size k. For the converse direction suppose that P 
contains a A:-set packing V. Thus, V is also a sunflower with core C = % P F . □ 
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